Self-organized Model for Modular Complex Networks : Division and Independence 
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We introduce a minimal network model which generates a modular structure in a self-organized way. To this 
end, we modify the Barabasi-Albert model into the one evolving under the principle of division and indepen- 
dence as well as growth and preferential attachment (PA). A newly added vertex chooses one of the modules 
composed of existing vertices, and attaches edges to vertices belonging to that module following the PA rule. 
When the module size reaches a proper size, the module is divided into two, and a new module is created. 
The karate club network studied by Zachary is a prototypical example. We find that the model can reproduce 
successfully the behavior of the hierarchical clustering coefficient of a vertex with degree k, C(fe), in good 
agreement with empirical measurements of real world networks. 
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PACS numbers: 89.65.-s, 89.75.Hc, 89.75.Da 

Recently, considerable effort has been made to understand 
complex systems in terms of random graphs, consisting of ver- 
tices and edges jjj, |2j, |3j |4[ . Such complex networks exhibit 
many interesting emerging patterns as follows: First, the de- 
gree distribution follows a power-law, P(k) ~ k r , where 
the degree is the number of edges connecting to a given ver- 
tex 0]. Such networks, called scale-free (SF), are ubiquitous 
in the real world. To illustrate such SF behavior in the de- 
gree distribution, Barabasi and Albert (BA) |5] introduced an 
in silico model: Initially, fully-connected m vertices exist in 
a system. At each time step, a vertex is newly added and con- 
nects to m existing vertices, which are chosen with a proba- 
bility linearly proportional to the degree of target vertex. Such 
a selection rule is called the preferential attachment (PA) rule. 

Secondly, many real world networks have modular struc- 
tures within them. Modular structures form geographically in 
the Internet |6], functionally in metabolic |7|] or protein in- 
teraction networks |8], or following social activities in social 
networks fToll . In these modular complex networks, the 
hierarchical clustering coefficient of a vertex with degree k, 
denoted by C(k), behaves as C{k) ~ k~ p HEU, 

where the 

clustering coefficient is defined as the ratio of the number of 
triangles connected to a given vertex to the number of triples 
centered on that vertex. Also the clustering coefficient aver- 
aged over all vertices is independent of system size N. In 
the BA model, however, C(k) is independent of k, but de- 
pends on N I2l. ll ill , because the BA model does not contain 
modules. To understand the behavior of C(k), a determinis- 
tic hierarchical model was introduced by Ravasz and Barabasi 
full , in which C{k) ~ k^ 1 and the clustering coefficient C 
is independent of N 1 12]. While it is important to understand 
the mechanism for the formation of such modular structure 
through an in silico model, few models have been studied, and 
none in which the modules were generated in a self-organized 
way. Thus it is our goal of this paper to introduce such a 
model. 

Thirdly, the degree-degree correlation in real world net- 
works is nontrivial. The nontrivial behavior is measured in 
terms of the mixing coefficient r 1 1 311 - a Pearson correlation 



coefficient between the remaining degrees of the two vertices 
on each side of an edge, where the remaining degree means 
the degree of that vertex minus one. Complex networks can 
be classified according to the mixing coefficient r into three 
types, having r < 0, r « 0, and r > 0, called the dissortative, 
the neutral, and the assortative network, respectively II 1311 . An 
assortative or dissorative network can also be identified by a 
quantity, denoted by {k nn )(k), the average degree of a neigh- 
boring vertex of a vertex with degree k 11411 . For the assor- 
tative (dissortative) network, (k nn )(k) increases (decreases) 
with increasing k, i.e., a power law (k nn )(k) ~ k~ v is satis- 
fied where v is negative (positive) for the assortative (dissor- 
tative) network 1 14]. 

In this paper, we are interested in modelling modular 
complex networks, in particular, forming in a self-organized 
way. In social networks, modules represent the communi- 
ties each individual belongs to, which may evolve as time 
passes. The karate club (KC) network, originally proposed 
by Zachary 11511 . is an example of a social network contain- 
ing community structures. Recently, Newman and Girvan 0] 
studied the KC network to test a new algorithm for cluster- 
ing communities fl Ufill . Here we notice that the KC network 




FIG. 1: A snapshot of the model network with parameters N = 34, 
mo = 4 and n = 17, looking similar to the Karate club network 
proposed by Zachary. Here two groups are identified by (o) and (•). 
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contains an important ingredient, division and independence, 
needed for the formation of modular structure, in addition to 
growth and PA principles as noticed in the BA model. Thus 
we introduce a network model evolving by such principles, 
and perform numerical simulations for large system size. In- 
deed, we find that the model exhibits a characteristic feature 
of modular structure, C(k) ~ fc" 1 as much as those for em- 
pirical data. 

To be specific, the main dynamic process of the evolution 
of the KC network is as follows. In a KC, a disagreement de- 
velops between the administrator of the club and the club's in- 
structor as time goes on, ultimately resulting in the instructor 
leaving (division) and founding a new club (independence), 
accompanied by about half the original club's members. This 
generic feature of division and independence can be observed 
in many other social communities such as schools, companies, 
churches, clubs, parties, etc. For example, in the coauthorship 
network, a graduate student publishes papers with her/his the- 
sis advisor, so that they are connected in a coauthorship net- 
work. When she/he graduates and becomes a professor in an- 
other school (division), she/he also get her/his own students, 
creating a new group (independence). 

To model the evolution of the KC network, we modify the 
BA model by assigning a color to each vertex. The color as- 
signed to each vertex indicates the group the vertex belongs 
to. The dynamic rule of our model is as follows: 

(i) BA model (Growth and PA) : Initially, there exist too 
vertices. They are fully connected. Each vertex i is assigned 
the same index of color /Ltj = 1. Thus the total number of 
distinct colors q = 1. At each time step, a vertex is intro- 
duced and connects to m existing vertices following the PA 
rule. Here m is not fixed, but is distributed uniformly among 
integers in the range [1, mo]. The new vertex j is also assigned 
the index of color /.ij = 1 and this process is repeated until the 
number of vertices reaches n, a cutoff of the group size. This 
process defines the first group q = 1. 

(ii) Division and independence : Then we identify the two 
vertices i and j among the group q with the largest and the 
second largest degree, respectively, for division and indepen- 
dence. Then the vertex j declares independence and changes 
its color to a new one, i.e., = q + 1. Then, each remaining 
vertex k(^ i,j) in the group having the same color as vertex i 
measures the distances d(k, i) and d(k, j) to the vertices i and 
j, respectively. If d(k,i) < d(k,j), then the vertex k keeps 
the index of color as it is, otherwise, it changes its index of 
color to that of j. Then the system comprises of q + 1 differ- 
ent groups, and then q + 1 — > q, by definition. So the newest 
group has the new color q. 

(iii) Growth and PA again : If q > 1, then a newly added 
vertex £ chooses one of q colors, say fii, with equal probabil- 
ity, and m, the number of outgoing links, also randomly from 
the integers 1, . . . , mo. Then m existing vertices are chosen in 
the group with the color [hi following the PA rule, and to edges 
are inserted between them and the new node. This process is 
repeated until the number of vertices of any group reaches n 
again. After then, we repeat the step of division and indepen- 



dence (ii) in that group only. 

The network constructed in this way is shown in FIG. 1 
based on the same number of vertices as the empirical data of 
the KC network. The structure of the model is different from 
the BA model due to the presence of modular structure. Note 
that in our model, one vertex may transfer from one group to 
another as time goes on, that is, a vertex can change its color as 
it transfers to a new group. This characteristic is different from 
that of the q-component static model proposed by the current 
authors 11711 . where each individual belongs concurrently to q 
different groups such as high school alumni, college alumni, 
company, etc. Those two models may reflect different aspects 
of our social community. 

Based on the empirical data by Zachary, we obtain topolog- 
ical properties of the KC network, which are listed in TABLE 
1 and FIG. 2. Until now, it has been believed that social net- 
works are generally assortative 1 1 3l ll 811 . But, in "division 
and independence" social networks such as the KC network, 
each element is connected to the others in a hierarchical way, 
without any mediator, leading to a dissortative network, as 
shown in TABLE 1 and FIG. 2. Since different colors repre- 
sent distinct modules 0,101 or communities @|, connections 
are very tight. Thus it is expected that the clustering coeffi- 
cient C is non-trivially large 1 18]. TABLE I shows the dis- 
sortativity and the highly-clustered nature of the KC network 
and our model. Agreements between the two are excellent ex- 
cept for the mixing coefficient r. Note that the r value of the 
model is not close to zero although we used the BA-type ran- 
dom attachment rule. It should be noted that the large value 
of C is obtained in a self-organized way. FIG. 2 shows that 
the degree distribution, P(k) ~ k~ 2J , the hierarchical clus- 
tering coefficient, C(k) ~ fc -10 , and (k nn )(k) ~ fc~ - 5 of 
the KC network, which are also in good agreement with those 
obtained from the present model network. Such agreements 
indicate that our simple model captures the essential topology 
of the KC network. 

More generally, we investigated the topological properties 
of our model network for large N with various n. In FIG. 3, 
we consider the case of N = 10000, too = 4, and n = 500. 
FIG. 3(a) shows the degree distribution of our model. It seems 
that P(k) follows a power law with the exponent 7 « 3.5, but 
that there exists plateau behavior for large k. The plateau for 
large k is caused by the artificially uniform cutoff of the group 



Name 


N 


(k) 


d 


r 


C 


Zachary 's 


34 


4.59 


2.41 


-0.48 


0.59 


Ours 


34 


4.61 


2.54 


-0.19 (-0.22) 


0.56 



TABLE I: Mean degree (k), the diameter d, the assortativity coef- 
ficient r, and the clustering coefficient C obtained from Zachary's 
KC network and from ours with parameter iV = 34, mo = 4 and 
n = 17. All the numerical values for the model are averaged over ten 
configurations. Note that Zachary presumed that the edge between 
the administrator and the instructor of the club no longer hold upon 
division and independence. Following the Zachary's way, we obtain 
r — —0.22 in our model. 
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FIG. 2: Plots of the cumulative degree distribution P CU m(fc) (a), the 
clustering coefficient C(k) (b), and (k nn )(k) (c) versus degree k. In 
all, the empirical data and the data from the model are denoted by (o) 
and (•), respectively. The parameters for the model network are the 
same as used in FIG. 1. Lines are drawn as a guide to the eye. 



FIG. 3: (a) Plots of the degree distribution P(k) (a), the clustering 
coefficient C(k) (b), and (k nn )(k) (c) versus degree k. The data in 
all figures are obtained with parameters TV = 10000, mo = 4 and 
n — 500. In this case we obtain the mean degree (fc) = 4.98, the 
diameter d = 4.87, the assortativity coefficient r = —0.24, and the 
clustering coefficient C = 0.42. 



size n, which should be modified to fir the empirical data, if 
available. If n is not uniform, but is made stochastic follow- 
ing, for example, a power law, then the shape of the plateau 
would change accordingly. FIG. 3(b) shows the hierarchical 
clustering coefficient C(k) behaving as ~ fc -10 , which is in 
good agreement with the Ravasz-Barabasi model O. FIG. 
3(c) shows (k nn )(k), showing a dissortative behavior. The ex- 
ponent v is somewhat different from the one measured in the 
small network in FIG. 2(c), because the size of N = 34 in 
FIG. 2 may be too small to measure the exponent v, as can be 
seen in small k of FIG. 3(c). Also there occurs a plateau re- 
gion for large k in (k un )(k). The dissortative behavior (y > 0) 



is caused by hierarchical organization inside a group. 

FIG. 4(a) and (b) show the rt-dependence of the hierar- 
chical clustering coefficient C(k). When n is very small 
with respect to network size N, C(k) behaves as ~ fc -10 , 
but as n increases to N, C(k) deviates from the power law 
C(k) ~ fc -10 . The n = 10 case shows the clear power 
law behavior. For n = 100, a scattered behavior occurs in 
the middle of the power law regime. This is found in the ac- 
tor network (FIG. 3(a) of Ref. 0)- For n = 500, many 
points are scattered in the middle of the power law regime, 
which is similar to the empirical results from the Internet au- 
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FIG. 4: The clustering coefficient C(k) versus degree k obtained 
with the parameters N = 10000, mo = 4, and n = 10 and 100 (a), 
500 (b), n = 1000 (c) and 10000 (d). 
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FIG. 5: The clustering coefficient C(k) versus degree k obtained 
under two selected conditions with large values of mo. 



tonomous system (FIG. 3(d) of Ref. JTH). For n = 1000, 
most of the data points for C(k) are scattered more diversely 
with k, which is similar to the results from the World Wide 
Web (WWW) (FIG. 3(c) of Ref. El]). When the group size 
n approaches the network size N, C(k) of our model reduces 
to that of the BA model (FIG. 2(b) of Ref. iTuR 

FIG. 5 shows the mo-dependence of the hierarchical clus- 
tering coefficient C(k). For large mo values, we can see 
clearly that C(k) has both a plateau regime from k = 2 to 
k ~ niQ and a power law regime satisfying C(k) ~ fc -10 
beyond that degree. When mo approaches the group size n, 
such as when mo — 15 and n = 20, i.e., when vertices in- 
side one module are nearly fully-connected, such a plateau 
with a C value near 1.0 appears. We can thus say that the ac- 
tor and language networks (FIG. 3(a) and 3(b) of Ref. lUHo 
have modules composed of nearly fully-connected vertices. 
Our model can thus explain most of the hierarchical cluster- 
ing structures of real world networks qualitatively well, when 
the two parameters toq an d n are properly selected. As an ex- 



ample, the case of mo = 10 and n = 200 of FIG. 5 shows 
a plateau regime as well as a scattered behavior in the middle 
of the power law regime, which are very similar to the actor 
network (FIG. 3(a) of Ref. lil ). 

In conclusion, we have generalized the BA model by as- 
signing a color to each vertex for the purpose of modelling 
modular complex networks in a simple way. The model 
evolves with time under the principle of division and indepen- 
dence, in a manner reminiscent of the KC network. Through 
this model, we confirmed the behavior of the hierarchical clus- 
tering coefficient, which is in accordance with the ones ob- 
tained from the deterministic hierarchical structure and the 
empirical data such as the Internet, the WWW, and the actor 
networks 1 1 1 ] . Also it was found that our model exhibits an 
dissortative mixing behavior as observed in the KC network. 
Our model can be modified in various ways, for example, di- 
versifying the group size cutoff n, to fit real world networks. 
Finally, we suggest that the principle of division and inde- 
pendence could be used in constructing modular complex net- 
works in various fields, for example, bio-complex networks, 
where the strong mutation of a gene may correspond to trans- 
ferring from one group to another in. 

This work is supported by the KOSEF Grant No. R14- 
2002-059-01000-0 in the ABRL program, Korea and by the 
Royal Society, London. 



[1] S. H. Strogatz, Nature 410, 268 (2001). 

[2] R. Albert and A. -L. Barabasi, Rev. Mod. Phys. 74, 47 (2002). 

[3] S. N. Dorogovtsev and J. F. F. Mendes, Adv. Phys. 51, 1079 

(2002) . 

[4] M. E. J. Newman, SIAM Review 45, 167 (2003). 

[5] A. -L. Barabasi and R. Albert, Science 286, 509 (1999). 

[6] K. A. Eriksen, I. Simonsen, S. Maslov, and K. Sneppen, Phys. 

Rev. Lett. 90, 148701 (2003). 
[7] E. Ravasz, A. L. Somera, D. A. Mongru, Z. N. Oltvai, and A. 

-L. Barabasi, Science 297, 1551 (2002). 
[8] A. W. Rives and T. Galitski, Proc. Natl. Acad. Sci. USA 100, 

1124 (2003). 

[9] M. Girvan and M. E. J. Newman, Proc. Natl. Acad. Sci. USA 
99, 8271 (2002); lcond-mat/0308217 1; 1 cond-mat/0309508 1. 

[10] R. Guimera, L. Danon, A. Diaz-Guilera, F. Giralt, and A. Are- 
nas, lcond-mat/0211498 I. 

[11] E. Ravasz and A. -L. Barabasi, Phys. Rev. E 67, 0261 12 (2003). 

[12] J. D. Noh, Phys. Rev. E 67, 045103(R) (2003). 

[13] M. E. J. Newman, Phys. Rev. Lett. 89, 208701 (2002); Phys. 
Rev. E 67, 026126 (2003). 

[14] R. Pastor-Satorras, A. Vazquez, and A. Vespignani, Phys. Rev. 
Lett. 87, 258701 (2001). 

[15] W. W. Zachary, J. Anthropol. Res. 33, 452 (1977). 

[16] H. Zhou, Phys. Rev. E 67, 041908 (2003); ibid 67, 061901 

(2003) . 

[17] D. -H. Kim, B. Kahng, and D. Kim, lcond-mat/0307184l. 
[18] M. E. J. Newman and J. Park, Phys. Rev. E 68, 036122 (2003). 
[19] K. -I. Goh, B. Kahng, and D. Kim, (unpublished). 



